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We present a search for electroweak production of single top quarks in the s-channel (pp-^tb+X) 
and t-channel {pp^tqb+X) modes. We have analyzed 230 pb~^ of data collected with the D0 
detector at the Fermilab Tevatron collider at a center-of-mass energy of ^/s — 1.96 TeV. Two 
separate analysis methods are used: neural networks and a cut-based analysis. No evidence for a 
single top quark signal is found. We set 95% confidence level upper limits on the production cross 
sections using Bayesian statistics, based on event counts and binned likelihoods formed from the 
neural network output. The limits from the neural network (cut-based) analysis are 6.4 pb (10.6 pb) 
in the s-channel and 5.0 pb (11.3 pb) in the i-channel. 

PACS numbers: 14.65.Ha; 12.15.Ji; 13.85.Qk 



I. INTRODUCTION 

The top quark, discovered in 1995 at the Fermilab 
Tevatron Colhder by the CDF and D0 collaborations [l|, 
is by far the heaviest elementary particle found to date. 
Its large mass and corresponding coupling strength to 
the Higgs boson of order unity suggest that the physics 
of electroweak symmetry breaking might be visible in the 
top quark sector. 

Top quarks are produced at the Tevatron mainly in 
top-antitop pairs through the strong interaction. This 
mode led to the discovery of the top quark and has been 
the only top quark production mode observed to date. 
The top quark decays predominantly to a M^ boson and 
a b quark, but little else is known experimentally about 
its electroweak interactions. 

All previous studies of the top quark electroweak in- 
teraction and the Wtb vertex have been done either in 
the low-energy regime using virtual top quarks (in stud- 
ies of b quark decays), or in the decay of real top quarks. 
Both of these types of studies presuppose the unitarity 
of the CKM matrix and are thus constrained to study- 
ing the standard model with three generations of quarks. 
This restriction can be overcome by exploring the pro- 
duction of single top quarks through electroweak inter- 
actions. This production mode is becoming accessible at 
the Tevatron and promises the first direct measurement 
of the electroweak coupling strength of the top quark as 
well as a first glimpse at possible top quark interactions 
beyond the standard model (SM). 



A. Physics with Single Top Quarks 

The study of single top quark production provides the 
possibility of investigating top quark related properties 
that cannot be measured in top quark pair production. 
The most relevant of these is a direct measurement of 
the CKM matrix element \Vtb\ from the single top quark 
production cross sections. This provides the only mea- 
surement of \Vtb\ without having to assume three quark 
generations or CKM matrix unitarity. Together with the 
other CKM matrix measurements [2|, we will be able to 
test the unitarity of the CKM matrix. 

Single top quarks are produced through a left-handed 
interaction. Therefore, they are expected to be highly 
polarized. Since the top quark decays before hadroniza- 
tion can occur, the spin correlations are retained in the 
final decay products. Hence, single top quark production 
offers an opportunity to observe the polarization and to 
test the corresponding SM predictions. 

Measurements of the charged-current couplings of the 
top quark probe any nonstandard structure of the cou- 
plings and can therefore provide hints of new physics. 
Any deviation in the {V-A) structure of the Wtb coupling 
would lead to a violation of the spin correlation proper- 
ties [2j. Furthermore, combining single top quark mea- 
surements with W helicity measurements in top quark 



decays provides the most stringent information on the 
Wtb coupling ^. 

Finally, rather than manifesting itself in a modified 
Wtb coupling, new physics could produce a single top 
quark final state through other processes. There are sev- 
eral models of new physics that would increase the single 
top quark production cross sections ja|. Thus, constraints 
on physics beyond the standard model are possible even 
before an actual observation of single top quark produc- 
tion. 



B. Single Top Quark Production 

There are three standard model modes of single top 
quark production at hadron colliders. Each of these 
modes may be characterized by the four-momentum 
squared Q^, the virtuality, of the participating W bo- 
son: 

• s-channel W boson exchange (Q^ > 0): This pro- 
cess, pp^tb+X, is referred to as "f6," which in- 
cludes both tb and tb (see Fig. QJ. 

• i-channel and u-channel W boson exchange (Q^ < 
0): This process, pp^tqb+X, has the largest cross 
section of the three. It includes the leading order di- 
agram (Fig.[2t) with a b quark from the proton sea 
in the initial state, and a second diagram (Fig. |2h) 
where an extra b quark appears in the final state 
explicitly. This latter mode is of order 0(as) in 
the strong coupling as, but nevertheless provides 
the largest contribution to the total cross section. 
Historically, i-channel production has also been re- 
ferred to as W^-gluon fusion, since the b quark in 
the final state arises from a gluon splitting to a bb 
pair. We refer to the i-channel process as "tqb," 
which includes tqb, iqb, tq, and iq. 
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Real W boson production (Q 
process, pp — > tW+X, a single top quark appears 
in association with a real W boson in the final state. 
This process has a negligible cross section at the 
Tevatron '3| and will not be addressed in this paper. 




FIG. 1: Feynman diagram for leading order s-channel single 
top quark production. 




characterized by a high transverse momentum (pt), cen- 
traUy produced, isolated lepton (e* or /i^) and missing 
transverse energy (^t), together with two or three jets. 
One of the jets comes from a high-p^ central b quark 
from the top quark decay. 

Figures |31 and El sh ows the transverse momenta and 
pseudorapidities 77 | l4| for the partons in our modeling of 
the s-channel and i-channel single top quark processes, 
after decay of the top quark and W boson. 
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FIG. 2: Representative Feynman diagrams for i-channel sin- 
gle top quark production. Shown is the (a) leading order and 
(b) the 0{aa) W-g\uon fusion diagram. 



The next-to-leading order (NLO) production rates at 
the Tevatron (-^5 = 1.96 TeV) for the s- and i-channel 
sin gle top quark modes have been calculated [a S H S 
[10, 11, 12] and the results for cross sections are shown in 
Table Q] The uncertainties include components from the 
choice of scale and the parton distribution functions, but 
not for the top quark mass. 
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TABLE I: Theoretically calculated total cross sections for sin- 
gle top quark production at a pp collider with -^s — 1.96 TeV, 
using mt = 175 GeV. 



Process 



Cross Section [pb] 



s-channel (tb) 
f-channel (tqb) 

tW production 



0.88 
1.98 



+0.07 



+0.23 



0.093 ± 0.024 



For comparison, the calculated top quark pair pro- 
duction cross section at the Tevatron at 1.96 TeV is 
6.77 ± 0.42 pb [13. This aheady makes it clear that 
it is more difficult to isolate the single top quark signal 
than the top quark pair signal. 

Under the assumption that all top quarks decay to a 
W boson and a b quark, and only using W boson decays 
to electron and muon final states, the final state signa- 
ture of a single top quark event detected in this analysis is 



FIG. 3: Distributions of transverse momenta (a) and pseudo- 
rapidity (b) for the final state partons in s-channel single top 
quark events. The histograms only include the final state of 
i, not t. 



The final state fermions from the top quark decay have 
relatively high transverse momenta and central rapidi- 
ties. Since the s-channel process involves the decay of 
a heavy virtual object, the b quark produced with the 
top quark is also at high transverse momentum and cen- 
tral pseudorapidity. By contrast, the light quark in the 
t-channel appears at lower transverse momentum and 
at more forward pseudorapidities because it is produced 
when an initial state parton emits a virtual W boson. 
The b quark from i-channel initial state radiation appears 
typically at very low pT and with large pseudorapidities 
and is thus often not reconstructed experimentally. 

Due to its electroweak nature, single top quark produc- 
tion results in a polarized final state top quark. It has 
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FIG. 4; Distributions of transverse momenta (a) and pseudo- 
rapidity (b) for the final state partons in i-channel single top 
quark events. The histograms only include the final state of 
t, not i. 



been shown [13 that the top quark spin follows the direc- 
tion of the down-type quark momentum in the top quark 
rest frame. This is the direction of the initial d quark 
for the s-channel and close to the direction of the final 
state d quark for the i-channel. The above result follows 
directly from the properties of the polarized top quark 
decays when single top quark production is considered 
as top quark decay going "backwards in time" [iq . 



C. Overview^ of the Backgrounds 

Searches for single top quark production are challeng- 
ing because of the very large backgrounds. The situation 
is significantly different from top pair production not just 
because of the smaller production rate, but more impor- 
tantly because of the smaller multiplicity of final state 
particles (leptons or jets). Single top quark events are 
typically less energetic (because there is only one heavy 
object), less spherical (because of the production mech- 
anism), and typically have two or three jets, not four as 
do ti events. 

Processes that can have the same single top quark 



experimental signature include in order of importance 
W^-|-jets, tt, multijet production, and some smaller con- 
tributions from Z-|-jets and diboson events. 

• W^+jets events form the dominant part of the back- 
ground. The cross section for W+2 jets produc- 
tion is over 1000 pb JJLilSj with Wbb contributing 
about 1%. 

• The second largest background is due to tt pro- 
duction. This process has a larger multiplicity of 
final state particles than single top quark events. 
However, when some of the jets or a lepton are not 
identified, the kinematics of the remaining particles 
are very similar to those of the signal. 

• Multijet events form a background in the electron 
channel when a jet is misidentified as an electron. 
The probability of such misidentification is rather 
small, but the >3 jet cross section is so large that 
the overall contribution is significant. 

Additionally, bb production contributes to the back- 
ground when one of the 6's decays semileptonically. 
This background in the electron channel is very 
small. In the muon channel, bb events form a back- 
ground when the muon is away from the jet axis or 
when the jet is not reconstructed. 

• Z/DrcU-Yan+jets production can mimic the single 
top quark signals if one of the leptons is misidenti- 
fed. 

• WW, WZ, and ZZ processes are the electroweak 
part of the VF-|-jets and Z-|-jets backgrounds, but 
with different kinematics. 

Single top quark events are kinematically and topolog- 
ically similar to VF-|-jets and ti events. Therefore, ex- 
tracting the signal from the backgrounds is challenging 
in a search for single top quark production. 



D. Status of Searches 

Both the CDF and D0 collaborations have previ- 
ously performed searches for single top quark produc- 
tion [1^ |23| • Recently, CDF performed a search using 
160 pb~^ of data and obtained upper limits of 13.6 pb 
(s-channel), 10.1 pb (i-channel), and 17.8 pb {s+t com- 
bined) at the 95% confidence level [21|. D0 has published 
a neural network search for single top quark production 
using 230 pb~^ of data ,22], which is described in more 
detail in this article. 



E. Outline of the Analysis 

We have performed a search for the electroweak pro- 
duction of single top quarks in the s-channel and t- 
channel production modes with the D0 detector at the 



Fermilab Tevatron collider. We consider lepton+jets in 
the final state, where the lepton is either an electron or 
a muon. 

To take advantage of the differences between s- and 
t-channel final state topologies, we differentiate the s- 
channel search from the i-channel search by requiring at 
least one untagged jet in the t-channel search. For both 
s-channel and i-channel searches, we separate the data 
into independent analysis sets based on the lepton flavor 
(e or fi) and the multiplicity of identified b quarks (one 
tagged jet or more than one). 

We use two different multivariate methods to extract 
the signal from the large backgrounds: a cut-based anal- 
ysis, first presented here, and an analysis based on neu- 
ral networks that was first presented in brief form in 
Ref. |22|- In the absence of any significant evidence for 
signal, we set upper limits at the 95% C.L. on the single 
top quark production cross sections. 

Finally, we present limit contours in a two-dimensional 
plane of the s-channel signal cross section versus the t- 
channcl signal cross section. 



F. Outline of the Paper 

This paper is organized as follows. SectionlHldescribes 
the D0 detector and the reconstruction of the final state 
objects. Section ITTll summarizes the triggers for the data 
samples used in the search and Section Hvl describes the 
selection requirements. Section Ivl explains the modeling 
of signals and backgrounds, and Section |^ presents the 
numbers of events passing all selections. Section lVlIl dis- 
cusses the most important variables that offer discrimina- 
tion between the signals and backgrounds, and provides 
details of the cut-based and the neural network analy- 
ses. Section IVIIII lists the systematic uncertainties in 
this measurement. Section llXl discusses the procedure for 
setting limits on the signal cross section using Bayesian 
statistics. The limits are presented in Section H and we 
summarize the results in Section IXII 



disks. The CFT has eight thin coaxial barrels, each sup- 
porting two doublets of overlapping scintillating fibers 
of 0.835 mm diameter, one doublet being parallel to the 
collision axis, and the other alternating by ±3° relative 
to the axis. Light signals are transferred via clear light 
fibers to solid-state photon counters (visible light photon 
counters, VLPCs) that have w 80% quantum efficiency. 

Central and forward preshower detectors arc located 
just outside of the superconducting coil (in front of the 
calorimetry). These are constructed of several layers of 
extruded triangular scintillator strips that are read out 
using wavelength-shifting fibers and VLPCs. The next 
layer of detection involves three liquid-argon/uranium 
calorimeters: a central section (CC) covering \r]\ up to 
~ 1, and two end calorimeters (EC) extending coverage 
to I77I « 4, all housed in separate cryostats 24] . In ad- 
dition to the preshower detectors, scintillators between 
the CC and EC cryostats provide sampling of developing 
showers for 1.1 < \r]\ < 1.4. 

A muon system resides beyond the calorimetry, and 
consists of a layer of tracking detectors and scintillation 
trigger counters before 1.8 T iron toroids, followed by 
two more similar layers after the toroids. Tracking for 
\r]\ < 1 relies on 10 cm wide drift tubes |2J|, while 1 cm 
mini drift tubes are used for 1 < jr^l < 2. 

The luminosity is obtained from the rate of inelastic 
collisions measured using plastic scintillator arrays lo- 
cated in front of the EC cryostats, covering 2.7 < jryj < 
4.4. 



B. Object Reconstruction 

Physics objects are reconstructed from the digital sig- 
nals recorded in each part of the detector. Particles can 
be identified by certain patterns and, when correlated 
with other objects in the same event, they provide the 
basis for understanding the physics that produced such 
signatures in the detector. 



II. THE D0 DETECTOR AND OBJECT 
RECONSTRUCTION 



1. Primary Vertex 



A. The D0 Detector 

The D0 detector [23 is shown in Figs. |S1 and and 
consists of several layered elements. The first is a mag- 
netic central-tracking system, which includes a silicon mi- 
crostrip tracker (SMT) and a central fiber tracker (CFT), 
both located within a 2 T superconducting solenoidal 
magnet. The SMT has « 800, 000 individual strips, with 
a typical pitch of 50 — 80 /J,m, and a design optimized 
for tracking and vertexing capability at pseudorapidities 
of \r]\ < 3.0. The system has a six-barrel longitudinal 
structure, each with a set of four layers arranged axially 
around the beam pipe, and interspersed with 16 radial 



The position of the hard scatter interaction is deter- 
mined at D0 by clustering tracks into seed vertices using 
a Kalman filter algorithm [23. The primary vertex is 
then selected using a probability function based on the 
Pt values of the tracks assigned to each vertex. The hard 
scatter vertex is distinguished from other soft interaction 
vertices by the higher average p^ of its tracks. In multijet 
data events, the position resolution of the primary ver- 
tex in the transverse plane (perpendicular to the beam 
pipe) is around 40 /im, convoluted with a typical beam 
spot size of around 30 /xm. For the longitudinal direction 
(along the beam pipe), the typical resolution is about 
1 cm. 
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FIG. 5: General view of the D0 detector. The proton beam travels from left to right and the antiproton beam from right to 
left in this figure. 



2. Electrons 

Electron candidates are initially identified as energy 
clusters in the central region of the electromagnetic 
calorimeter, |?7| < 1.1. We define two classes of electron 
candidates: loose and tight. Loose electrons are required 
to have the fraction of their total energy deposited in 
the electromagnetic (EM) calorimeter /em > 0.9 and a 
shower-shape chi-squared, based on seven variables that 
compare the values of the energy deposited in each layer 
of the electromagnetic calorimeter with average distri- 
butions from simulated electrons, to be xLi < "^^^ ^i" 
nally, loose electron candidates are also required to be 
isolated by measuring the total deposited energy and the 
energy from the EM calorimeter only around the electron 
track: ^Totai(i? < 0.4) < 1.15 x EEuiR < 0.2), where 
R = ^y{A4>p + {Ar])^ is the radius of a cone defined by 
the azimuthal angle (j) and the pseudorapidty r/. 

For an electron candidate to be included in the tight 
class, a track must be matched to the loose cluster within 
|At7| < 0.05 and |A0| < 0.05, and additionally pass a 
cut on a seven-variable likelihood built to separate real 
electrons from backgrounds. The following variables are 
used in the likelihood: (i) /em; (ii) xLv (iii) ^f?Vp^''''=^ 
transverse energy of the cluster divided by the transverse 



momentum of the matched track; (iv) x^ probability of 
the track match; (v) distance of closest approach between 
the track and the primary vertex in the transverse plane; 
(vi) iVtracks, the number of tracks inside a cone of i? < 
0.05 around the matched track; and (vii) J2pt of tracks 
in an R < 0.4 cone around the matched track. Tight 
electrons are obtained by applying a cut on the likelihood 
of £ > 0.85. The overall tight electron identification 
efficiency in data is around 75%. 

A comparison between the dielectron invariant mass 
distributions for Z — > ee simulated events and data shows 
that the position of the simulated Z boson peak is shifted 
from that in data, and that the electron energy resolution 
is better than in data. We apply small corrections to 
the identification efficiency and electromagnetic energy 
of simulated electrons and smear their energies to agree 
with data. 



3. Muons 

Muons are reconstructed in D0 up to \rj\ = 2 by first 
finding hits in all three layers of the muon spectrometers 
and requiring that the timing of these hits is consistent 
with the hard scatter, thus rejecting cosmic rays. Sec- 
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FIG. 6: Close view of the tracking systems. 



ondly, all muon candidates must be matched to a track 
in the central tracker. That central track must pass the 
following criteria: (i) chi-squared per degree of freedom 
less than 4; (ii) the distance of closest approach to the 
primary vertex in the transverse plane must be less than 
three standard deviations; and (iii) the distance in z be- 
tween the track and the primary vertex must be less than 
1 cm. 

As for electrons, we similarly define two classes: loose 
and tighi^ but this time based solely on the muon's iso- 
lation from other objects. A loose isolated muon must 
comply with i?(muon, jet) > 0.5, which is the distance 
between the muon and the jet axis. A tight isolated muon 
must be loose and additionally satisfy track-based and 
calorimeter-based criteria: Y^Z'''^'^ ^ Vt I VT{li)\ < 0.06 
where the sum is over tracks within a cone of i?(track, 

muon)< 0.5; and \J2'^'^ ^ Et/pt{ij)\ < 0.08 where the 
sum is over calorimeter cells within an anulus of 0.1 < 
_R(calorimeter cell, muon)< 0.4. The overall tight nuion 
identification efficiency in data is around 65%. 

Similarly to electrons in the simulation, we correct the 
energy scale for simulated muons and smear their energies 
to reproduce the data in Z — > /i/j,. 



4- Jets 

We reconstruct jets based on calorimeter cell energies, 
using the improved legacy cone algorithm 26] with radius 



R ~ 0.5. Noisy calorimeter cells are ignored in the re- 
construction algorithm by imposing the requirement that 
neighboring cells have signals above the noise level. 

Jet identification is based on a set of cuts to reject 
poor quality jets or noisy jets: (i) 0.05 < /em < 0.95; 
(ii) fraction of jet Et in the coarse hadronic calorimeter 
layers < 0.4; (iii) ratio of E't's of the most energetic cell 
to the second most energetic cell in the jet < 10; and (iv) 
smallest number of towers that make up 90% of the jet 
Et, "-90 > 1- 

Jet energy scale corrections are applied to convert jet 
energies from the reconstructed level into particle-level 
energies. The reconstructed fully-corrected energy of jets 
from the simulation of the detector performance does not 
exactly match that seen in data. Similar to electrons and 
muons, we smear jet energies by a small amount in the 
simulation to reproduce the resolution measured in data. 



5. Missing Energy 

We infer the transverse energy of the neutrino in the 
event as the opposite of the vector sum of all the energy 
deposited in the calorimeter. This calorimeter-only miss- 
ing transverse energy is then corrected with the jet energy 
scale, the electromagnetic scale, and the energy loss from 
isolated muons in the calorimeter and their momenta. 
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C. Identification of b-Quarlt Jets 

The presence of b quarks can be inferred from the long 
lifetime of B hadrons, which typically travel a few mil- 
limeters before hadronization. Thus 5-quark jets contain 
a displaced vertex inside a jet whereas light-quark jets 
do not. The Secondary Vertex Tagger (SVT), described 
below, makes use of this fact to identify, or tag, 6-quark 
jets by fitting tracks in the jet into a secondary vertex. 



1. Taggability 

Before the 6-quark tagging algorithm is applied to iden- 
tify displaced vertices in the jet, a set of cuts is applied 
to ensure a good quality jet and factor out detector ge- 
ometry effects. Thus the final probability to identify a 
&-quark jet is factored into two parts: a taggability part, 
or jet-quality-sensitive component, and a tagger part, 
or heavy-flavor-sensitive component. A taggable jet re- 
quires at least two tracks within a cone of i? = 0.5. At 
least one of these tracks must have px > 1-0 GeV, and 
additional tracks must have pt > 0.5 GeV. All tracks 
must have at least one SMT hit, an xy distance-of-closest- 
approach (DCA) of < 0.2 cm, and a z DCA of < 0.4 cm 
with respect to the primary vertex. The taggability is 
the number of taggable jets divided by the number of 
good jets. Only jets satisfying jet identification require- 
ments, with PT > 15 GeV (after jet energy corrections) 
and 1 77 1 < 2.5 are considered to be good for the definition 
of taggability. 

In simulated events, the taggability is higher than in 
data mainly due to a non-comprehensive description of 
the tracking detectors (dead detector elements, other in- 
efficiencies, noise, etc.) resulting in a higher tracking ef- 
ficiency (in particular within jets). Therefore, the Monte 
Carlo taggability must be calibrated to that observed in 
the data. A taggability-rate function is utilized to do this 
by parametrizing the taggability as a function of jet px 
and r). Thus, the taggability per jet is determined in data 
and applied to the Monte Carlo as: 

ptaggabie/ _ # taggable jcts in {pr, ij) bin 

^P^'"^' #jetsin(pT,r7)bin ' ^'> 

Central jets with momenta above 40 GeV have taggabil- 
ities of around 85%. For simulated jets the taggability is 
« 90%. 



2. Secondary Vertex Tagger 

The SVT algorithm is designed to reconstruct a dis- 
placed vertex inside a jet by fitting tracks that have 
a large impact parameter from the hard scatter ver- 
tex. A simple algorithm is applied to the tracks to re- 
move most Kg's, A's, and photon conversions. Tracks 
are then required to have at least two SMT hits. 



Pt > 1-0 GeV, transverse impact parameter significance 
(dca/o'd^a) greater than 3.5, and a track x^ > 10. A sim- 
ple cone jet-algorithm is used to cluster the tracks into 
track-jets, and then a Kalman filter algorithm is used 
to find vertices with the tracks in each track-jet. The 
distance between the primary vertex and the found sec- 
ondary vertex, the decay length L^y, and its error ctl^ 
are calculated taking into account the uncertainty on the 
primary vertex position. The decay length is a signed 
parameter, defined by the sign of the cosine of the angle 
between the vector from the primary vertex to the decay 
point and the total momentum of the tracks attached 
to the secondary vertex. If the decay length significance 
Lxy/cTL-, is more than 7, then the found vertex is con- 
sidered a tag. A calorimeter jet is considered tagged if 
the distance between the jet axis and the line joining the 
primary vertex and the secondary vertex is i? < 0.5 in 
77, (j) space. This set of cuts has been tuned to obtain a 
probability for a light quark mistag of 0.25%. Note that 
gluon jets are included in the light quark category. 

We estimate the b tagging efficiency in a dijet data 
sample. The heavy flavor content of the sample is en- 
hanced by requiring one of the jets to have a high-py 
muon relative to the jet axis. The SVT efficiency to tag 
the other jet can then be inferred. We estimate the c 
quark tagging efficiency from a Monte Carlo simulation. 
The mis-tagging rate, or how often a light-flavor jet (from 
u, d, s quarks or gluons) is identifled as a 6 jet, is also 
measured in a dijet data sample. We count the num- 
ber of found secondary vertices with Lxy/cTL^ < —7 and 
correct for the contribution of heavy-flavor jets in the 
sample and the presence of long-lived particles in light- 
flavor jets. The sign in the decay length measurement 
comes from the scalar product of the decay length vector 
and the unit vector defined by the Figure [7| shows the 
tagging efficiency as a function of jet pt for the different 
types of jets. 

To calculate the probability for a simulated jet to be 
tagged, a tag-rate function (TRF) derived from data is 
used similarly to the taggability parametrized in pT and 
77: 

ptagging/ # SVT tagged jets in (pr, j) bin 

# taggable jets in (pt, 7?) bin 

Separate functions are determined for 6-quark jets, c- 
quark jets, and light-quark jets, as in Fig. 

The TRFs are applied to the Monte Carlo samples in 
the following way. First, for each jet in the event (with 
Pt > 15 GeV and I77I < 3.4) a taggability-rate function 
is applied. Next, each jet's lineage is determined. If 
the jet contains a B meson within R < 0.5 of the jet 
axis it is labeled a 5-quark jet. li a D meson is within 
R < 0.5 of the jet axis, it is labeled a c-quark jet. If 
no B or D meson is found in the jet, the jet is labeled 
a light-quark jet. The probability determined from the 
appropriate TRF is then applied. The taggability and 
tagging probability are multiplied together to determine 
the probability of the simulated jet to be tagged. 
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FIG. 7: Measured 6-tagging efficiency (circles) and mis- 
tagging rate (triangles), and estimated c-tagging efficiency 
(solid line) as a function of jet pr- 



In data we apply the secondary vertex algorithm di- 
rectly and can identify which jet is tagged and which 
is not. The situation in simulated events is different; 
the TRFs return a probability (or weight) rather than 
a tagged/not-tagged answer per jet. Since many of the 
discriminant variables used later on in the analysis (see 
Sec. lVIlA^ need to know which jet was tagged, each pos- 
sible combination of tagged and untagged jets is consid- 
ered for every simulated event. Thus each event is used 
repeatedly in the analysis, considering each time a differ- 
ent jet as tagged. The probability of each combination 
is calculated using the tag rate functions, and combined 
with the overall event weight. The sum of the weights 
for all the possible combinations of each event is equal to 
the original probability for an event to have at least one 
tagged jet. 

The use of all permissible tagged jet combinations in 
each simulated event is a very powerful tool. It ensures 
that the kinematic distributions in histograms of tagged 
events have the correct shape, and it allows tagged jet in- 
formation to be used in variables for signal/background 
separation, since the final classifiers are trained with 
weighted events. 



that are segmented longitudinally into electromagnetic 
and hadronic sections. The level 1 electron trigger re- 
quires electrons to be above a certain threshold: Et = 
E sin > T where E is the energy deposited in the tower, 
6 is the angle between the beam and the trigger tower 
from the center of the detector, and T is the programmed 
threshold. The level 2 electron trigger uses a seed-based 
clustering algorithm that sums the energy deposited in 
two neighboring towers and has the ability to make a 
decision based on the threshold of the cluster, the elec- 
tromagnetic fraction, and isolation of the electron. The 
level 3 electron trigger uses a simple cone algorithm with 
R < 0.25 and requirements on the Et, the electromag- 
netic fraction, and the quality of the transverse shower 
shape. 

The level 1 jet trigger is similar to the electron trigger 
tower algorithm, but includes the energy deposited in the 
hadronic portion of the calorimeter. The level 2 jet trig- 
ger uses a seed-based clustering algorithm summing the 
energy deposition in a 5 x 5 tower array. The level 3 jet 
algorithm is similar to the level 3 electron algorithm, but 
does not include a requirement on the electromagnetic 
fraction or shower shape. 

The level 1 muon trigger examines hits from the muon 
wire chambers, muon scintillation counters, and tracks 
from the level 1 track trigger for patterns consistent with 
those coming from a muon. The level 2 muon trigger 
reconstructs muon tracks from both wire and scintillator 
elements in the muon system. It can impose requirements 
on the number of muons, the px and t] of the muons, and 
the overall quality of the muons. The level 3 muon trigger 
uses wire and scintillator hits to reconstruct tracks using 
segments inside and outside the toroid. 

The output of the first level of the trigger is used to 
limit the rate for accepted events to ~ 1.5 kHz. At the 
next trigger stage, with more refined information, the 
rate is reduced further to « 800 Hz. The third level 
of the trigger, with access to all the event information, 
reduces the output rate to w 50 Hz, which is written to 
tape. 

The data were acquired in the period between August 
2002 and March 2004. Tables IHl and UHl show the trig- 
gers used to collect the data for the electron plus jets 
(e-fjets) and muon plus jets (/i-t-jets) triggers and give 
the integrated luminosity for each trigger. 



III. TRIGGERS AND DATA SET 

The D0 trigger system is composed of three levels. 
The first level consists of hardware and firmware com- 
ponents, the second level uses information from the first 
level to construct simple physics objects, and the third 
level is software based and performs full event reconstruc- 
tion. 

The D0 calorimeter is used to trigger events based on 
the energy deposited in towers of size Ar/ x A0 = 0.2 x 0.2 



IV. EVENT SELECTION 

Event selection begins after all corrections have been 
applied to the data. These corrections include the jet 
energy and the EM energy calibrations. The primary 
vertex, Zvertex, for the event must be within the tracking 
fiducial region, |2;vertex| < 60 cm, which allows for a suf- 
ficient number of tracks, A'tracks > 3, associated with it 
to be properly reconstructed. 

As discussed in Sec. II CI the single top quark signature 
is characterized by one isolated high-p^ charged lepton. 
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TABLE 11: Trigger conditions at levels 1, 2, and 3 for the electron plus jets trigger. 





Level 1 
Condition 


Level 2 
Condition 


Level 3 
Condition 


Luminosity 


1 EM tower, Et > 10 GcV 
2 jot towers, Et > 5 GeV 


1 e, Et > 10 GeV, EM fraction > 0.85 
2 jets, Et > 10 GeV 


1 tight e, Et > 15 GeV 
2 jets, Et > 15 GeV 


19.4 pb"^ 


1 EM tower, Et > 10 GeV 
2 jet towers, Et > 5 GeV 


1 e. Et > 10 GeV, EM fraction > 0.85 
2 jets, Et > 10 GeV 


1 loose e, Et > 15 GeV 
2 jets, Et > 15 GeV 


91.2 pb"^ 


1 EM tower, Bt > 11 GeV 




1 tight e, Et > 15 GeV 
2 jets, Et > 20 GeV 


115.4 pb"-' 





TABLE IIL Trigger conditions at levels 1,2, and 3 for the muon plus jets trigger. 



Level 1 
Condition 


Level 2 
Condition 


Level 3 
Condition 


Luminosity 


1 M, 1 ') l< 2.0 
1 jet tower, Et > 5 GeV 


1 t2,\v \< 2.0 


1 jet, Et > 20 GeV 


113.7 pb"^ 


1 A", 1 1) l< 2.0 
1 jet tower, Et > 3 GeV 


1 fi, \n\< 2.0 

1 jet, Et > 10 GeV 


1 jet, Et > 25 GeV 


113.7 pb"^ 



^T, and two to four jets. We accept events with three 
or four jets in order to include contributions from extra 
gluons and quarks. The b jet from the single top quark 
decay tends to be more energetic than the other jets as- 
sociated with the event, so we require a higher Et for the 
leading jet. Table Hvl lists the requirements of the initial 
selection. 



tagged or double-tagged. Since the i-channel requires 
at least one untagged jet, there are no two-jet events in 
the double-tagged sample in the double-tagged i-channel 
search. 



SIGNAL AND BACKGROUND MODELING 



TABLE IV: Initial event selection requirements. 



Selection Cut 




e-fjets 




fi+jets 


tight e, Et > 15 GeV 


= 1 




=0 


tight /i, Et > 


15 GeV 


=0 




= 1 


Pt 






> 15 GeV 




A^jets 






2 < ATjets < 4 




St (jet) 






> 15 GeV 




I'^Oet)! 






<3.2 




ETiietl) 






> 25 GeV 




I'yOeti)! 






< 2.4 





In addition, we make a set of cuts that remove misre- 
constructed events, also known as "triangle cuts." If the 
transverse energy of an object is mismeasured, this tends 
to create false missing energy in a parallel or antiparallel 
direction. The triangle cuts remove these mismeasured 
events, which are difficult to model, but do not affect the 
signal appreciably because there is very small signal ac- 
ceptance in these kinematic regions. In Fig. |H1 we show 
the kinematic regions that are removed by the triangle 
cuts. 

From the selected jets in the event, at least one b- 
tagged jet must be found. For the t-channel analysis, 
at least one jet must be untagged. This requirement 
comes from the fact that one of the main features of the 
t-channel signal is that a light-quark jet exists in the final 
state. The events are then divided into subsets consisting 
of the number of tagged jets found in the event: single- 



In order to compare the observed event yield in data 
with our expectation, and to set limits on the single 
top quark production cross sections, we determine ac- 
ceptances and event yields for the single top quark sig- 
nals and the various SM background contributions. This 
estimation is based primarily on simulated samples for 
shapes of distributions, except for the multijet back- 
ground where we use data samples. The yield normal- 
ization is based on theoretical cross sections, except for 
the W^-l-jets and multijet backgrounds which are normal- 
ized to data. 



A. Acceptance and Yield for Simulated Samples 

The acceptance a for a particular simulated signal or 
background sample is calculated as: 



1 



^MC Z^' 



(3) 



where the sum is over simulated events that pass the 
selection cuts and is normalized to the total number of 
simulated events in the sample N^'~^. The event weight 
Wi is given by: 



Wi 



lepton ID jet ID 



X e 



trigger b tagging 



(4) 



and includes correction factors e to account for effects 
not modeled and for cuts not applied to the simulated 
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FIG. 8; Kinematic regions excluded in the e+jets and /i+jets analyses by the triangle cuts applied in the (A(j!)(object, _^t),^t) 
plane, where each object can be: the tight isolated electron or muon (a), and the leading and second leading jets (b). The 
shaded areas are excluded. 



samples. Trigger requirements are not made in the simu- 
lation (see Sec. lIII|) and the correction factors g/'^gcr ^^^ 
about 90%. Furthermore, we do not require b tagging 
in simulated events, and the correction factor e^ '^sgmg 
averages about 55% for s-channel events and about 40% 
for i-channel events. 

The yield estimate y is given by the product of ac- 
ceptance, integrated luminosity £, theory cross section 
^theory ^ and branching fraction B: 

y^axCx a*'>°°'y X B. (5) 

The branching fraction factor gives the fraction of events 
that result in the final state lepton of interest (e or fi). 
The yield includes a small contribution from W —* t 
decays where the t decays to e or /x. 



B. Single Top Quark Signals 

The CompHEP matrix element generator [23 has been 
used to model single top quark s-channel and i-channel 
signal events. We include not only the leading order 
Feynman diagrams in the event generation, but also the 
0{as) diagrams with real gluon radiation in order to 
reproduce NLO distributions. For the t-channel sam- 
ple, we include both the leading order diagram (Fig. [3 
(a)) and the VF-gluon fusion diagram (Fig. [21(b)) explic- 
itly, generating W^-gluon fusion events for the region of 
phase space where the 6 quark from gluon splitting has 
Prib) > 17 GeV and leading order events otherwise. 



C. tt Background 

Top quark pair production contributes as a background 
both in the lepton-|-jets and in the dilepton decay chan- 
nels. This background is modeled using ALPGEN [13, and 
the yields are normalized to the theory cross section (see 
Sec.im. 



D. WW and WZ Backgrounds 

The backgrounds from diboson production are mod- 
eled using ALPGEN, and the yields are normalized to the 
theory cross sections |2a |. 



E. Multijet and Vl^+jets Backgrounds 

The backgrounds from multijet (fake lepton) and 
M^-|-jets production are normalized to the data sample 
before b tagging [23 • We start from a data sample pass- 
ing all selection cuts including the loose lepton require- 
ments (see Sec. llllj|l . From that sample, we select a sub- 
set of events that also pass the tight lepton requirements. 
In addition, we determine the probabilities for real and 
fake leptons to pass the tight lepton requirement. These 
two probabilities together with the numbers of events in 
the two samples then allow us to calculate the number of 
real and fake lepton events in the W-l-jets and multijet 
background samples _30| . 

The shapes of the distributions for the multijet back- 
ground are modeled using a data sample that passes all 
selection cuts but fails the tight lepton identification re- 
quirements. The shapes of distributions for the VF-|-jets 
background are modeled using ALPGEN W^-|-2jets events. 



1. Multijet Background 

A part of the background comes from events in which 
jets are misidentified as isolated leptons. In the electron 
channel, this background is typically produced by jets 
that contain a tt'', which, together with a randomly asso- 
ciated track, is misreconstructed as an isolated electron 
since it decays to two photons. In the muon channel, this 
background is typically produced by heavy-flavor jets in 
which a muon from a semileptonic decay is misrecon- 
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structed as an isolated high-pT muon. 

The multijet background is estimated purely from 
data. We use multijet data samples that pass all event 
selection requirements, but fail the requirement on tight 
muon isolation or tight electron quality (see Sec. IIIB|I to 
determine the kinematic shape of distributions. These 
samples are normalized to the multijet background esti- 
mate in the data sample after event selection, but before 
requiring a b tag. 



2. W+Jets Background 

An example Feynman diagram for W+2 jet production 
is shown in Fig. This background is modeled from a 
simulated Wjj sample (j = u,d, s,c, g), which includes 
not just light-quark flavors but also c quarks (considered 
massless in this model). We use a separate sample for 
Wbb and explicitly exclude events with b quarks from the 
Wjj sample. The parton level samples were generated 
with ALPGEN. 




w 



FIG. 9: Representative Feynman diagram for Wbb produc- 
tion. 



Since the VF-|-jets background is normalized to data 
(after subtraction of the small ti and diboson content), 
it includes all sources of VF-|-jets events with a similar 
flavor composition, in particular Z-t-jets events where one 
of the leptons from the Z boson decay is not identified. 



VI. EVENT YIELDS 

The expected event yields for the various background 
contributions are calculated from both simulated samples 
and data. The expected event yield for the single top 
quark signal is calculated from simulated samples and 
normalized to the theoretical cross sections. 

The total background event yield y is given by the sum 
over all backgrounds: 



y = j:y^ 



(6) 



where each individual yield yi is given by Eq. \S[ for the 
various MC samples. 

Table shows the numbers of events for each of the 
signals, combinations of signals, backgrounds, and data, 
after event selection and b tagging. The background sum 
reproduces the data within uncertainties for all samples 
after 6 tagging. 

A summary of the yield estimates for the signal and 
backgrounds and the numbers of observed events in data 
after selection, including the systematic uncertainties as 



described in Sec. IVIIIl is shown in Table IVIl 

After b tagging, the W-|-jets background makes up 
around 60% of the total background model (48% Wjj, 
12% Wbb), the ti background is around 27% (21% lep- 
ton-l-jets, 6% dilepton), 10% is mainly multijet back- 
ground, and s-channel single top quark production pro- 
vides 3% in the i-channel search and vice versa. 



VII. EVENT ANALYSIS 

TablelVlshows that even after event selection and 6 tag- 
ging, the expected single top quark signal yield is small 
compared to the overwhelming backgrounds. Additional 
steps are necessary in order to separate the signal and 
background. In this section, we first present kinematic 
variables that allow us to separate the s-channel or t- 
channel single top quark signal from the backgrounds. 
We then describe a cut-based analysis and a neural net- 
works analysis that use these variables. 



Detector Simulation 



The parton-level samples for the single top quark sig- 
nals, tt, VF-|-jets, WW, and WZ backgrounds are pro- 
cessed with PYTHIA [sJl for hadronization and modeling 
of the underlying event, using the CTEQSl [32| parton dis- 
tribution functions. TAUOLA '331 is used for tau lepton 
decays and evtgen |3J| for B hadron decays. The gen- 
erated events are processed through a GEANT-based [35l | 
simulation of the D0 detector. 



A. Discriminating Variables 

In this section we introduce the variables that we found 
to be most effective in separating the single top quark 
signals from the backgrounds. The list of discriminating 
variables has been chosen based on an analysis of Feyn- 
man diagrams of signals and backgrounds 36 1 and on a 
study of single top quark production at NLO [ll|, |l^ . 

The variables fall into three categories: individual ob- 
ject kinematics, global event kinematics, and variables 
based on angular correlations. The list of variables is 
shown in Table ETH 
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TABLE V: Event yields after selection in the electron and muon channels. 









Electron Channel 






Muon Channel 








before tag 


= 1 tag 


>2 tags 
s-channel 


>2 tags 
t-channel 


before tag 


= ltag 


>2 tags 
s-channel 


>2 tags 
i-channel 


Signals 




















tb 




5 


2.3 


0.53 


— 


5 


2.2 


0.50 


— 


tqb 




12 


4.1 


— 


0.25 


11 


3.9 


— 


0.23 


Backgrounds 


















th 




— 


— 


— 


0.14 


— 


— 


— 


0.13 


tqb 




— 


— 


0.32 


— 


— 


— 


0.29 


— 


ii^^+jets 


59 


25.0 


6.12 


5.74 


58 


24.2 


5.78 


5.48 


tt~*U 




16 


6.8 


1.58 


0.74 


17 


7.2 


1.62 


0.75 


Wbb 




40 


15.1 


2.49 


0.59 


33 


12.7 


2.15 


0.55 


Wjj 




3,211 


68.2 


1.53 


0.66 


2,898 


62.9 


1.29 


0.68 


WW 




13 


0.7 


0.00 


0.00 


14 


0.7 


0.00 


0.00 


WZ 




4 


0.6 


0.11 


0.02 


5 


0.5 


0.09 


0.01 


Multijet 




478 


13.7 


0.31 


0.14 


256 


17.2 


0.20 


0.20 


Summed 


signals 


17 


6.4 


0.53 


0.25 


16 


6.0 


0.50 


0.23 


Summed 


backgrounds 


3,821 


130.1 


— 


— 


3,280 


125.4 


— 


— 


Summed 


backgrounds+tqfo 


3,833 


134.2 


12.47 


— 


3,291 


129.3 


11.43 


— 


Summed 


backgrounds+ib 


3,826 


132.4 


— 


8.03 


3,285 


127.6 


— 


7.80 


Data 




3,821 


134 


15 


11 


3,280 


118 


16 


8 



TABLE VI: Estimates for signal and background yields and 
the numbers of observed events in data after event selection 
for the electron and muon, single-tagged and double-tagged 
analysis sets combined. The W^-j-jets yields include the di- 
boson backgrounds. The total background for the s-channel 
(t-channel) search includes the tqb (tb) yield. The quoted yield 
uncertainties include systematic uncertainties taking into ac- 
count correlations between the different analysis channels and 
samples. 



Source 



s-channel search 



t-channel search 



tb 


5.5±1.2 


4.8±1.0 


tqb 


8.6 ±1.9 


8.5 ±1.9 


W-l-jets 


169.1 ±19.2 


163.9 ±17.8 


tt 


78.3 ±17.6 


75.9 ±17.0 


Muhijet 


31.4 ±3.3 


31.3 ±3.2 



Total background 
Observed events 



287.4 ±31.4 
283 



275.8 ±31.5 
271 



thus the leading 6-tagged jet is chosen to reconstruct the 
top quark. By contrast, in the s-channel there are two 
high-pT b quark jets in the final state, and a choice needs 
to be made between them. Furthermore, typically only 
one of the two is identified as a 6-tagged jet. We use the 
best-jet algorithm Tff| to identify this jet without using b 
tagging information. The best jet is defined as the jet in 
each event which gives, together with the reconstructed 
W boson, an invariant mass closest to 175 GeV. Jets that 
have not been identified by the b tagging algorithm are 
called "untagged" jets. 

Figures [Tnitoll4lshow all discriminating variables used 
in this analysis, comparing the single top quark signal dis- 
tributions to those of the background sum and the data. 
Good agreement between the data and the background 
model is seen in all cases. 



B. Cut-Based Analysis 



In order to get optimum separation between signal 
and background, the single top quark final state is re- 
constructed according to whether a variable is primarily 
used in the s-channel or the t-channel search. The W bo- 
son from the top quark decay is reconstructed from the 
isolated lepton and the missing transverse energy. The z- 
component of the neutrino momentum is calculated using 
a W boson mass constraint, choosing the solution with 
smaller |p^| from the two possible solutions. The candi- 
date top quark is reconstructed from this W boson and 
a jet. This jet is chosen to be either the leading 6-taggcd 
jet or the best jet. In the t-channel analysis, there is 
typically only one high-p^ b quark jet in the final state, 



This analysis takes the discriminating variables, 
chooses the best subsets, and finds the optimal points 
to cut on them in order to improve the expected cross 
section limits by increasing the signal to background ra- 
tio. 

Optimization of the cut positions is performed by us- 
ing the signal Monte Carlo events to seed the cut values 
scanned in the algorithm. The signal and background 
pass rates are determined for each cut point, an expected 
limit on the cross section is obtained from these, and the 
best result is used as the operating point of the analysis. 

The strategy is to look at the s- and t-channel pro- 
cesses separately to take full advantage of the kinemat- 
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FIG. 10: Comparison of signal, backgrounds, and data after selection and requiring at least one b-tagged jet for five individual 
object variables. Electron and muon channels are combined. The transverse momentum is shown for (a) the leading tagged 
jet; (b) the leading untagged jet; (c) the second untagged jet, for those events that contain at least two untagged jets; (d) the 
leading non-best jet; and (e) the second non-best jet. Signals are multiplied by ten. 
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FIG. 11: Comparison of signal, backgrounds, and data after selection and requiring at least one 6-tagged jet for five discrim- 
inating event kinematic variables. Electron and muon channels are combined. Shown are (a) the invariant mass of all final 
state objects, (b) the total transverse momentum of the leading two jets, (c) the transverse mass of the leading two jets, (d) 
the invariant mass of all jets, and (e) the total transverse energy of all jets. Signals are multiplied by ten. 
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FIG. 12: Comparison of signal, backgrounds, and data after selection and requiring at least one b-tagged jet for five discrim- 
inating event kinematic variables. Electron and muon channels are combined. Shown are (a) the transverse momentum of all 
jets except the leading tagged jet, (b) the invariant mass of all jets except the leading tagged jet, (c) the total energy of all 
jets except the leading tagged jet, (d) the total transverse energy of all jets except the leading tagged jet, and (e) the invariant 
mass of the top quark reconstructed from the reconstructed W boson and the leading tagged jet. Signals are multiplied by ten. 
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FIG. 13: Comparison of signal, backgrounds, and data after selection and requiring at least one b-tagged jet for four discrim- 
inating event kinematic variables. Electron and muon channels are combined. Shown are (a) the invariant mass of all jets 
except the best jet, (b) the total energy of all jets except the best jet, (c) the total transverse energy of all jets except the best 
jet, and (d) the invariant mass of the top quark reconstructed from the reconstructed W boson and the best jet. Signals are 
multiplied by ten. 
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TABLE VII: List of discriminating variables. A check mark in the final four columns indicates in which signal-background pair 
of the neural net analysis the variable is used. 



Variable 



Description 



gnal-Background Pairs 
tb tqb 

Wbb tt Wbb tt 



PT(jetltaggcd) 
PT(jetl untagged) 
PT(jet2u„tagged) 
PT(jetl„o„_bcst) 

PT(jet2„„„_bcst) 

VI 

PT(jetl,jet2) 
MT(jetl,jet2) 
Af(alljets) 
ffTCalljets) 

PT(alljets-jetltaggod) 
Af(alljets- jetlt^gg^d) 
-H'(alljets-jetltagggd) 
HT(alljets- jetlt^gg^d) 
M(H/,jetlt,gg,d) 
M(aUjets- jetbest) 
_ff(alljets-jetbj,st) 
^T(alljets- jetbc.t) 

'?(Jetluntaggod) X Qe 

A7e(jetl,jet2) 

cos(t:, jetlujjtg^gg^jjjtoptaggod 

cos(£, QfX2)topb„,t 

COs(alljetS,jetltaggcd)alljets 
COs(alljCtS,jct„„„_bcst)alljcts 



Individual object kinematics 

Transverse momentum of the leading tagged jet 
Transverse momentum of the leading untagged jet 
Transverse momentum of the second untagged jet 
Transverse momentum of the leading non-best jet 
Transverse momentum of the second non-best jet 

Global event kinematics 
Invariant mass of all final state objects 
Transverse momentum of the two leading jets 
Transverse mass of the two leading jets 
Invariant mass of all jets 
Sum of the transverse energies of all jets 

Transverse momentum of all jets excluding the leading tagged jet 
Invariant mass of all jets excluding the leading tagged jet 
Sum of the energies of all jets excluding the leading tagged jet 
Sum of the transverse energies of all jets excluding the leading tagged jet 
Invariant mass of the reconstructed top quark using the leading tagged jet 
Invariant mass of all jets excluding the best jet 
Sum of the energies of all jets excluding the best jet 
Sum of the transverse energies of all jets excluding the best jet 
Invariant mass of the reconstructed top quark using the best jet 

Angular variables 
Pseudorapidity of the leading untagged jet X lepton charge 
Angular separation between the leading two jets 
Top quark spin correlation in the optimal basis for the t-channel 
the top quark with the leading tagged jet 

Top quark spin correlation in the optimal basis for the s-channel 
the top quark with the best jet 

Cosine of the angle between the leading tagged jet and the alljets system in the 
alljets rest frame 

Cosine of the angle between the leading non-best jet and the alljets system in the 
alljets rest frame 



151: 



151: 





V 


v 


V 


— 




— 


— 


V 






v^ 


^ 







V 


V 


— 


— 




V 


— 


V 


V 




V 


— 


V 


— 




V 


— 


— 


— 




V 


V 


V 


V 




— 


— 


V 


— 




— 


V 


— 


^/ 




— 


v 


— 


ed jet 


— 


— 


— 


^/ 


ged jet 


V 


V 
V 

V 


V 


V 




— 


— 


— 




— 


y 


— 


— 




V 


— 


— 


— 




— 


— 


V 


V 




x/ 


— 


V 


— 


reconstructing 


— 


— 


V 


— 


reconstructing 


V 


— 


— 


— 



- - V V 

- V - - 



leal differences between the channels. For each channel, 
there are four orthogonal analyses: two leptons (e, ^) x 
number of tagged h jets (== 1, > 2). 

The most critical part of this analysis is to find the 
combination of variables and cuts that leads to the low- 
est expected cross section limit. We first look at single- 
variable cuts to determine which variables are most ef- 
fective in each channel. Once an ordered list of variables 
is found (ordered by their power to lower the expected 
limit) , sets of variables are formed starting with the best 
variable and consecutively including one-by-one the rest 
of the variables. For each set, the optimal cut position 
of each variable is recalculated. Finally, the variable set 
that gives the lowest expected limit is chosen. Table IVTllI 
shows the optimal variable sets and cuts found for each 
channel. Table IIXI shows the numbers of events and ex- 
pected background and signal yields after these cuts have 
been applied. 

A summary of the yield estimates for the signal and 
backgrounds and the numbers of observed events in data 
after the cut-based selection, including the systematic 
uncertainties as described in Sec. IVIIII is shown in Ta- 



ble El 

Ths s- and f-channel combined signal to background 
ratio improves from around 1/20 after the basic selection 
(Table IVlll to around 1/14 after these cuts have been 
applied. It is clear that more sophisticated separation 
techniques are needed to isolate the signal better from 
the large backgrounds. 



C. Neural Network Analysis 



A neural network is a multivariate statistical technique 
for separating signals from backgrounds. We use the 
MLPFIT 37] package to construct and implement the net- 
works. In order for a neural network to approach the 
maximal signal-background separation, some optimiza- 
tion is required. This occurs in three steps: 1) judicious 
choice of signal and background pairs, 2) selection of in- 
put variables, and 3) optimization of training parameters. 
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FIG. 14: Comparison of signal, backgrounds, and data after selection and requiring at least one &-tagged jet for six angular 
correlation variables. Electron and muon channels are combined. Shown are (a) the pseudorapidity of the leading untagged jet 
multiplied by the lepton charge; (b) the angular separation between the leading two jets; (c) the top quark spin correlation in 
the optimal basis for the f-channel; (d) the top quark spin correlation in the optimal basis for the s-channel; (e) the cosine of 
the angle between the leading tagged jet and alljets, in the alljets frame; and (f) the cosine of the angle between the leading 
non-best jet and alljets, in the alljets frame. Signals are multiplied by ten. 



TABLE VIII: The best set of variables and cuts for each analysis channel. 





s-channel 




t-channel 




Channel 


Variables 


Cuts 


Variables 


Cuts 


Electron 










=1 tag 


PT(jetltaggod) 


> 27 GeV 


//t (alljets) 


> 71 GeV 




M(alljets-jetltaggcd) 


< 70 GeV 


M(alljets) 


> 57 GeV 




VI 


> 196 GeV 


73 

il75-A/(IV,jetltaggcd)l 

PT (jetltaggcd) 


> 203 GeV 
< 57 GeV 

> 21 GeV 


>2 tags 


PT(jetltaggod) 


> 42 GeV 


PT (jetltaggcd) 


> 34 GeV 




M(alljets - jetltaggcd) 


< 98 GeV 


M(alljets - jetltaggcd) 


< 75 GeV 




//(alljets- jetbest) 


< 304 GeV 


//(alljets -jetltaggcd) 


< 504 GeV 




//(alljets — jetltagged) 


< 304 GeV 


//(alljets -jetbest) 


< 504 GeV 


Muon 










=1 tag 


PT (jetltaggcd) 


> 33 GeV 


i 175 -M(W^, jetltaggcd)! 


< 60 GeV 




M(alljets - jetltaggcd) 


< 74 GeV 


^ 


> 210 GeV 




//(alljets -jetbest) 


< 504 GeV 


M(alljets) 


> 70 GeV 




//(alljets -jetltagged) 


< 504 GeV 


//t (alljets) 


> 58 GeV 


>2 tags 


PT (jetltaggcd) 

M(alljets - jetltagged) 

//(alljets- jetbest) 
//(alljets- jetltaggcd) 


> 33 GeV 
< 74 GeV 

< 504 GeV 

< 504 GeV 


!l75-A/(W^,jetltaggcd)l 


< 213 GeV 
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TABLE IX: Event yields after the cut-based analysis selection. 









Electron Channel 






Muon Channel 








=lTag 


>2 Tags 


=lTag 


>2 Tags 






s-channel 


i-channel 


s-channel 


f-channel 


s-channel 


i-channel 


s-channel 


i-channel 


Signals 
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TABLE X: Estimates of backgrounds and signal yields and the 
number of observed events in data after the cut-based selec- 
tion for the electron and muon, =1 tag and >2 tags analyses 
combined. 



Source 


s-channel search 


t-channcl search 


tb 


4.5 ±1.0 


3.2±0.8 


tqb 


5.5±1.2 


7.0 ±1.6 


W+jets 


27.6 ±7.6 


55.9 ±12.3 


tt 


102.9 ±13.7 


72.6 ±9.7 


Multijet 


17.2 ±2.0 


17.0 ±2.0 


Total background 


153.1 ±24.5 


148.7 ±24.8 


Observed events 


152 


148 



1. Choice of Signal- Background Pairs 

We have chosen to create networks trained on single 
top quark signals against the two dominant backgrounds: 
VF+jets and tt. For VF+jets, we train using a Wbb Monte 
Carlo sample as this process best represents all M^+jets 
processes. For ti, we train on ii-^^+jets which is the 
dominant background as opposed to the dilcpton back- 
ground which is small. 



nation that produces the minimum testing error, which 
corresponds to the best signal-background separation. 

We use the same variables for the electron and muon 
channel. However, owing to different resolutions and 
pseudorapidity ranges, we train the networks separately 
for the two. 



3. Neural Network Training 

Each network is composed of three layers of nodes: 
input, hidden, and output. Testing and training event 
sets are created from simulated signal and background 
samples. We divide the input samples such that 60% 
of the events are used for training and the remaining 
40% for testing. Training is effected with weighted events 
and the logarithm of all nonangular variables. We use 
a technique called early stopping [Sg to determine the 
maximum number of epochs for training which prevents 
over-training. 

Each network is further tuned by varying the number 
of hidden nodes between 10 and 30 and then selecting the 
number of hidden nodes that returns the smallest testing 



2. Choice of Input Variables 



4- Neural Network Results 



We start from a set of discriminating variables that 
each show some signal-background separation as dis- 
cussed in Sec. IVII Al Based on this, we optimize the 
input variables for each network by training with differ- 
ent combinations of variables and choosing the combi- 



The above procedure produces eight unique networks: 
two signals (s-channel, i-channel) x two backgrounds 
{Wbb, tt— >.^+jets) X two lepton flavors (e, fi). 

Figures El and El show the output variable distribu- 
tions from the networks in the s-channel and i-channel 
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searches for electrons and muons. From the figures, it can 
be seen that these networks are highly efficient at sepa- 
rating the single top quark signal from the tt— s-^+jets 
background. Studies have shown that these networks are 
not as effective for the tt dilepton background, which is 
fortunately small. The s-channel and i-channel networks 
arc less efficient at separating the single top quark signal 
from the Wbb background as compared to it-^^+jets. In 
addition, we find these networks are equally effective in 
separating the Wjj and the misidentified lepton back- 
ground as compared to the Wbb background. It should 
be noted that the output variable from mlpfit networks 
is not restricted to lie between zero and one. 

Figures El and El show comparisons of the summed 
backgrounds to data for the s-channel and i-channel 
searches, for electrons, muons, single-tagged, and double- 
tagged samples combined. These distributions show that 
the background model reproduces the data very well. 
From the figures, it can be seen that the tt-^£-|-jets filters 
do indeed separate the tt background which clusters near 
zero, but does not affect the M^-l-jets and multijet back- 
grounds, which cluster near one. Similarly, the Wbb fil- 
ters discriminate the H^-l-jets and multijet backgrounds, 
which cluster to the left of 0.5, but do not affect the tt 
background, which clusters to the right of 0.5. They also 
show that separation of the single top quark signal from 
background is not yet powerful enough since the back- 
ground dominates even in the regions where the signal 
peaks. 

Figure El shows the output of the t6-tt network versus 
the tb-Wbb network, and similarly for the tqb networks, 
again for electrons, muons, single-tagged, and double- 
tagged events combined. 



VIII. SYSTEMATIC UNCERTAINTIES 

We consider several sources of systematic uncertainties 
in this analysis, and study them separately for each signal 
and background source. Some of the uncertainties affect 
acceptance for simulated signals and backgrounds, oth- 
ers only affect background yield estimates. This section 
lists the uncertainties for each signal and background and 
their correlations. 

We consider the following sources of systematic uncer- 
tainty: 

• The 5-tag modeling uncertainty includes compo- 
nents for the estimation of the b tagging efficiency 
in data for the various quark fiavors, see Sec. Ill ("I 

• The jet energy calibration uncertainty reflects how 
well jet energies measured in the simulation reflect 
jet energies measured in data, and includes jet en- 
ergy scale uncertainty as well as modeling of jet 
energy resolution in the simulation, see Sec. IIIBI 

• The trigger modeling uncertainty includes compo- 
nents for the estimation of the efficiency of the var- 
ious trigger requirements in data, see Sec. lIIII 



The jet fragmentation uncertainty covers the uncer- 
tainty in modeling of initial- and final-state radia- 
tion as well as the difference in the fragmentation 
model between pythia and herwig 



ragi] 



• The uncertainty on the correction factor for simu- 
lated samples to account for the jet identification 
efficiency as described in Sec. IIIBI 

• The uncertainty on the correction factor for lepton 
identification efficiency in simulated samples as de- 
scribed in Sec. IIIBI 



• The cross section and branching fraction uncertain- 
ties from the yield normalization of simulated back- 
grounds. 

• The uncertainty on the normalization of the multi- 
jet and H^-|-jets background yields to the data. 

• The uncertainty on the integrated luminosity mea- 
surement. 

The uncertainty on the multijet background normaliza- 
tion includes two components: the estimate of the rate 
to misidentify a jet as an isolated lepton in the data, and 
the 6-tagging probability in the multijet data sample. 

The uncertainty for the Wjj and Wbb backgrounds 
includes several components: the normalization of the 
T4^+jets background to data before b tagging, the b- 
tagging probability estimate, and the fraction of Wbb 
events in the M^^-l-jets sample. Owing to the normaliza- 
tion to data, the Wbb and Wjj tagged yield estimates 
are not affected by any of the systematic uncertainties 
that affect the other simulated samples. The exception 
to this is b tagging, which is applied after normalization. 
There is still an effect on the shape of the Wjj and Wbb 
distributions from uncertainty components that vary bin- 
by-bin. 

Table IXII shows the systematic uncertainty values for 
each signal and background component. The range is 
given for the different analysis channels, electron and 
muon as well as single tags and double tags. 

Note that the W^-l-jets background includes small con- 
tributions from WW and WZ, whose uncertainties are 
also included in the limit setting calculation. Further- 
more, the normalization for Wbb and Wjj accounts for 
the other simulated backgrounds and thus their uncer- 
tainties in principle also affect Wbb and Wjj. However, 
the other simulated backgrounds only contribute about 
3% to the pretagged yield, which means their uncertain- 
ties are negligible compared to the overall normalization 
uncertainties. 



IX. CROSS SECTION LIMITS 

We use a Bayesian approach [331 to calculate limits on 
the cross section for single top quark production in the 
s-channel and f-channel modes. The limits are derived 
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FIG. 15: Neural network outputs in the s-channel. This figure shows the signal-background separation for (a) the filter for 
Wbb in the electron channel, (b) the filter for ii-^^+jets in the electron channel, (c) the filter for Wbb in the muon channel, 
and (d) the filter for tt— i-^+jets in the muon channel where the background is the dashed-lined and the top quark signal is the 
solid line. All the curves are normalized to have equal area, so that the separation between signal and background can be best 
seen. 



TABLE XI: Range of relative systematic uncertainty values in percent for the various signal and background samples in the 
different analysis channels. 



tb 



tqb 



tt 



W+iets 



multijet 



Signal and background acceptance 



6-tag modeling 


5-20 


8-20 


6-20 


Jet energy calibration 


6-20 


6-15 


3-11 


Trigger modeling 


2-6 


2-6 


— 


Jet fragmentation 


5 


5 


7 


Jet identification 


1 - 13 


5-11 


1-4 


Lepton identification 


4 


4 


4 


Background normalization 








Theory cross sections 


16 


15 


— 


Normalization to data 


— 


— 


— 


Luminosity 


6.5 


6.5 


6.5 



7-20 



5-16 



5-16 



from a likelihood function that is proportional to the 
probability to obtain the number of observed events. In 
the cut-based analysis, we count the total number of ob- 
served events, and in the neural network analysis, we use 
the two-dimensional distributions of the tt versus Wbb 
network outputs. 



A. Bayesian Approach 

We assume that the probability to observe a count D, if 
the mean count is d, is given by the Poisson distribution: 



p{D\d) 



jD 



T{D + 1) ' 



(7) 



where T is the gamma function. The mean count d is a 
sum of the predicted contributions from the signal and 
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FIG. 16: Neural network outputs in the f-channel. This figure shows the signal-background separation for (a) the filter for VFfefe 
in the electron channel, (b) the filter for if-^f+jets in the electron channel, (c) the filter for W^66 in the muon channel, and (d) 
the filter for tt-^^+jets in the muon channel where the background is the dashed-lined and the top quark signal is the solid 
line. All the curves are normalized to have equal area, so that the separation between signal and background can be best seen. 
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FIG. 17: Comparison of signal, background, and data for the neural network outputs in the s-channel, for the electron and 
muon channels combined, requiring at least one 6-tag. This figure shows (a) the tt filter and (b) the Whb filter. Signals are 
multiplied by ten. 



background sources: 



AT 



i=l 



AT 



(i = a £ CT + >, &i = acr + N^ ^i , 



(8) 



where a is the signal acceptance, C the integrated lu- 
minosity, a the signal cross section (the quantity of in- 



terest), bi the mean count for background source i, and 
a = a £ is the effective luminosity for the signal. For 
the s-channel (t-channel) search, the background bi in- 
cludes the t-channel (s-channel) process. The likelihood 
function L{D\d) is proportional to p{D\d). 

For two or more independent channels, we simply re- 
place the single channel likelihood by a product of likeli- 
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FIG. 18: Comparison of signal, background, and data for the neural network outputs in the t-channel, for the electron and 
muon channels combined, requiring at least one 6-tag. This figure shows (a) the tt filter and (b) the Wbb filter. Signals are 
multiplied by ten. 
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FIG. 19: Neural network outputs for both the tt versus Wbb filters in the (a) s-channel (b) and t-channel analyses. The 
background sum is shown as the shaded area, the signal as contour lines, and the data as stars. 



hoods: 



M 



L(D|d) = L(D|a,a,b) = JJ L{a,\d, 



where D and d, respectively, represent vectors of the ob- 
served counts and the mean counts for the sources of 
signal and background, in the M different channels. In 
addition, given K bins of any distribution, we calculate 
the likelihood for each channel as the product of the in- 
dividual likelihoods in each bin: 



K 



L{D,\di) = Y[ L{a, 



ijl'^ij) ■ 



which is true if the probability to observe a count in a 
given bin is independent of the count in other bins. 
(q\ We use Bayes' theorem to compute the posterior prob- 

ability density of the parameters, p(cr, a, b|D), which is 
then integrated with respect to the parameters a and b to 
obtain the posterior density for the signal cross section, 
given the observed distribution of counts D: 

p{a\T>) = — /L(D|cr,a,b)7r(cr,a,b)dadb. (11) 

Here Af is an overall normalization obtained from the 
requirement J p{(7\I})da = 1, and 7r(<T, a, b) is the prior 
(10) probability that encodes what we know about the param- 
eters a, a and b. We assume that any prior knowledge 
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of a and b is independent of the cross section cr, in which 
case we may write the prior density as 



7r(cr, a, b) — 7r(a, b|(T)7r(cr) 
= 7r(a, b) 7r(cr) . 



(12) 



We use a flat prior for a: 7r(cr) — l/cTmax, where a^^x is 
any sufhciently high upper bound on the cross section. 
The posterior probabihty density for the signal cross sec- 
tion is therefore 

p{cr\T>) = ^11 L(D|cr, a, b)7r(a, b) dadh . (13) 

The Bayesian upper hmit ctul at confidence level /3 is the 
solution of 



/ p{a\T>)da ^f3. 
Jo 



(14) 



The integral in Eq. El is done numerically using Monte 
Carlo importance sampling: we generate a large number 
K of randomly sampled points (afc,bfc) that represents 
the prior density 7r(a, b), and estimate the posterior using 



/ / L(D|cr,a,b)7r(a,b)dadb = — ^ L(D|cr,afc,b 



(15) 



B. Definition of the Prior Probability 



The prior 7r(a, b) encodes our knowledge of the effec- 
tive signal luminosities and the background yields: we 
have estimates of the parameters and the associated un- 
certainties from the different systematic effects discussed 
in Sec. IVlllI In the case of the cut-based analysis, since 
we consider the total yield for any source of signal or 
background, the different uncertainties affect the over- 
all normalization only. In the neural network analysis, 
since we consider distributions, we separate the uncer- 
tainties into two classes: those that alter only the overall 
normalization, such as the luminosity measurement and 
theory cross sections; and those that also alter the shapes 
of distributions, such as the trigger modeling, jet energy 
calibration, jet energy resolution, jet identification, and 
6-tag modeling. 

The normalization effects are modeled by sampling the 
effective signal luminosities a and the background yields 
b from a multivariate Gaussian, with a vector of means 
given by the estimates of the yields, and covariance ma- 
trix computed from the associated uncertainties. The 
covariance matrix takes into account the correlations of 
the systematic uncertainties across the different sources 
of signal and background. Each entry in the covariance 
matrix is calculated as follows: 



ViVj X! •^^'^•^J'^ ' 



where yi (jjj) is the yield for the i {j ) source of back- 
ground or signal from Table IVl and fik is the correspond- 
ing fractional uncertainty from the k^^ component of sys- 
tematic uncertainty, for the i'^ source. 

The shape effects are modeled by shifting, one by one, 
the trigger modeling, jet energy calibration, 5-tag model- 
ing, and so on, by plus or minus one standard deviation 
with respect to their nominal values. For each system- 
atic effect, we have three distributions: the nominal, and 
those from the plus and minus shifts. The systematic 
uncertainty in each bin is then sampled from a Gaussian 
distribution with mean defined by the nominal yield in 
that bin, and width defined by the plus and minus shifts. 
The sampled shifts are added linearly to the yields gener- 
ated from the sampling of the normalization-only system- 
atic uncertainties. We assume that any shape-changing 
systematic is 100% correlated across all bins and sources. 



X. RESULTS 

For both the s-channel and i-channel searches, we com- 
pute an observed limit as well as an expected limit. We 
define the latter as the limit obtained if the observed 
counts were equal to the background prediction. The 
different tag multiplicities (= 1 tag and > 2 tags) and 
lepton flavor (electron and muon) are combined as shown 
inEq.El 

The expected and observed upper limits at the 95% 
confidence level, after the initial event selection, and from 
the cut-based and neural network analyses, are shown in 
Table IXIll for the electron and muon channels combined, 
and with all systematic effects included. We see that 
the limits improve upon applying cuts on the discrim- 
inating variables, but that tighter limits are obtained 
when the variables are combined using our neural net- 
works method. The observed posterior probability den- 
sities as a function of the s-channel and f-channel cross 
sections are shown in Fig. [201 for the cut-based analysis 
and in Fig. l21l for the neural network analysis. 



TABLE XII: Expected and observed upper limits (in pico- 
barns) at the 95% confidence level, on the production cross 
sections of single top quarks in the s-channel (tb) and t- 
channel (tqb) searches, for the electron and muon channels 
combined, with all systematic effects included. 



Expected Limits 
tb tqb 



Observed Limits 
tb tqb 



(16) 



Initial selection 14.5 16.5 13.0 13.6 

Cut-based 9.8 12.4 10.6 11.3 

Neural networks 4.5 5.8 6.4 5.0 



The method described so far yields limits on the s- 
channel or t-channel cross sections separately. This re- 
quires some assumptions about whichever of the two sig- 
nal processes is not being considered. In this particu- 
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FIG. 20: The observed posterior probability density as a func- 
tion of the single top quark cross section for the cut-based 
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s-channel and the i-channel searches. 
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FIG. 21: The observed posterior probability density as a func- 
tion of the single top quark cross section for the neural net- 
work analysis, for the electron and muon channels combined 
in the s-channel and the i-channel searches. 



lar analysis, we have assumed that in the s-channel (t- 
channel) search, the t-channel (s-channel) contributes as 
a SM background. This assumption is, however, not nec- 
essary. Instead, we can set limits on both the s-channel 
and t-channel cross sections simultaneously. We accom- 
plish this by generalizing the likelihood so that it depends 
explicitly on the two cross sections CTs and at . Equation|Sl 
for the mean count d then becomes: 



Ps(t) = 



s(t) 



nt + J2i ^* 



(18) 



for the s-channel (t-channel) search, where n^ and n^ are 
the yields for the tb and tqb samples, respectively, and 
the sum in the denominator is over all the non-single top 
quark backgrounds in that bin. We then evaluate Pg and 
Pt simultaneously for each event and fill histograms of Ps 
versus Pt- As before, we consider a Poisson probability 
for the likelihood in each bin. We assume a flat prior 
in the plane of as versus at, which is equivalent to flat 
priors for either cross section. Equations 113! and 115! can 
then be used to define the posterior probability density 
for different values of the s- and t-channel cross sections. 
The limit at a fixed confidence level is then given by a 
contour of constant probability enclosing a fraction of 
volume corresponding to this confidence level using an 
equation analogous to Eg. 1141 but in two dimensions. 

Figure 1^ shows contours of observed posterior density 
in the ag versus at plane for the neural network analy- 
sis. To illustrate the sensitivity of this analysis to differ- 
ent contributions, the expected SM cross section as well 
as several representative non-SM contributions are also 
shown [3. 
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FIG. 22: Exclusion contours at the 68%, 90%, and 95% confi- 
dence levels on the observed posterior density distribution as 
a function of both the s-channel and f-channel cross sections 
in the neural networks analysis. Several representative non- 
standard model contributions from Ref. [g] are also shown. 



:£as + atCat 



A^' 



(17) 



The backgrounds hi now include only the non-single top 
quark sources. 

In order to exploit the sensitivity to both the s-channel 
and t-channel signals, we combine the output of the neu- 
ral networks in both searches. We calculate a signal prob- 
ability P in each bin of the histograms in Fig. ^| 



XI. SUMMARY 

We have analyzed electron-|-jet and muon-fjet events 
containing exactly one or more than one b jet, identi- 
fied with a secondary-vertex algorithm, and find no evi- 
dence for the electroweak production of single top quarks 
in 230 pb~^ of data collected by the D0 detector at 
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•^i = 1.96 TeV. The upper limits at the 95% confidence 
level on the cross section for s-channel and t-channel 
processes are 10.6 pb and 11.3 pb, respectively, using 
event counts in a cut-based analysis, and 6.4 pb and 
5.0 pb, respectively, using binned likelihoods in a neural 
network analysis. The neural network-base limits pre- 
sented here and in Ref. |22| are significantly more strin- 
gent than those previously published 19, 20, 21]. They 
are also close to the sensitivity required to probe models 
of physics beyond the standard model. 
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